智能论文笔记

Causality-based Neural Network Repair

Bing Sun , Jun Sun , Hong Long Pham , Jie Shi

分类：人工智能 | 机器学习

2022-04-20

神经网络在广泛的应用中具有明显的成就。广泛的采用也引起了人们对它们的可靠性和可靠性的关注。与传统的决策计划类似，神经网络可以具有需要修复的缺陷。这些缺陷可能会导致不安全的行为，提高安全问题或不公正的社会影响。在这项工作中，我们解决了修复神经网络的问题，以了解公平和缺乏后门等理想特性。目的是构建一个神经网络，该神经网络通过（微小）调整给定神经网络的参数（即权重）来满足该属性。具体来说，我们建议护理（\ textbf {ca}基于用途的\ textbf {re}对），一种基于因果关系的神经网络维修技术，1）执行基于因果关系的故障本地化，以识别“有罪”神经元和2）优化确定的神经元的参数减少了不当行为。我们已经对各种任务进行了经验评估，例如后门去除，神经网络维修的公平性和安全性能。我们的实验结果表明，护理能够有效地修复所有神经网络。对于公平维修任务，Care成功地将公平性提高了61.91美元\％$。对于后门删除任务，CARE将攻击成功率从$ 98 \％$降低到小于$ 1 \％$。对于安全物业维修任务，CARE将财产违规率降低到$ 1 \％$。结果还表明，由于基于因果关系的故障定位，CARE的维修重点关注不当行为并保留神经网络的准确性。

translated by 谷歌翻译

A Robust and Low Complexity Deep Learning Model for Remote Sensing Image Classification

Cam Le , Lam Pham , Nghia NVN , Truong Nguyen , Le Hong Trang

分类：计算机视觉 | 机器学习

2022-11-05

In this paper, we present a robust and low complexity deep learning model for Remote Sensing Image Classification (RSIC), the task of identifying the scene of a remote sensing image. In particular, we firstly evaluate different low complexity and benchmark deep neural networks: MobileNetV1, MobileNetV2, NASNetMobile, and EfficientNetB0, which present the number of trainable parameters lower than 5 Million (M). After indicating best network architecture, we further improve the network performance by applying attention schemes to multiple feature maps extracted from middle layers of the network. To deal with the issue of increasing the model footprint as using attention schemes, we apply the quantization technique to satisfies the number trainable parameter of the model lower than 5 M. By conducting extensive experiments on the benchmark datasets NWPU-RESISC45, we achieve a robust and low-complexity model, which is very competitive to the state-of-the-art systems and potential for real-life applications on edge devices.

translated by 谷歌翻译

Can Current Task-oriented Dialogue Models Automate Real-world Scenarios in the Wild?

Sang-Woo Lee , Sungdong Kim , Donghyeon Ko , Donghoon Ham , Youngki Hong , Shin Ah Oh , Hyunhoon Jung , Wangkyo Jung , Kyunghyun Cho , Donghyun Kwak

分类：自然语言处理

2022-12-20

Task-oriented dialogue (TOD) systems are mainly based on the slot-filling-based TOD (SF-TOD) framework, in which dialogues are broken down into smaller, controllable units (i.e., slots) to fulfill a specific task. A series of approaches based on this framework achieved remarkable success on various TOD benchmarks. However, we argue that the current TOD benchmarks are limited to surrogate real-world scenarios and that the current TOD models are still a long way from unraveling the scenarios. In this position paper, we first identify current status and limitations of SF-TOD systems. After that, we explore the WebTOD framework, the alternative direction for building a scalable TOD system when a web/mobile interface is available. In WebTOD, the dialogue system learns how to understand the web/mobile interface that the human agent interacts with, powered by a large-scale language model.

translated by 谷歌翻译

Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

Andrey Ignatov , Radu Timofte , Maurizio Denna , Abdel Younes , Ganzorig Gankhuyag , Jingang Huh , Myeong Kyun Kim , Kihwan Yoon , Hyeon-Cheol Moon , Seungho Lee

分类：计算机视觉

2022-11-07

Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose the participants to design an efficient quantized image super-resolution solution that can demonstrate a real-time performance on mobile NPUs. The participants were provided with the DIV2K dataset and trained INT8 models to do a high-quality 3X image upscaling. The runtime of all models was evaluated on the Synaptics VS680 Smart Home board with a dedicated edge NPU capable of accelerating quantized neural networks. All proposed solutions are fully compatible with the above NPU, demonstrating an up to 60 FPS rate when reconstructing Full HD resolution images. A detailed description of all models developed in the challenge is provided in this paper.

translated by 谷歌翻译

Label driven Knowledge Distillation for Federated Learning with non-IID Data

Minh-Duong Nguyen , Quoc-Viet Pham , Dinh Thai Hoang , Long Tran-Thanh , Diep N. Nguyen , Won-Joo Hwang

分类：机器学习 | 人工智能

2022-09-29

在现实世界应用中，联合学习（FL）遇到了两个挑战：（1）可伸缩性，尤其是应用于大型物联网网络时；（2）如何使用异质数据对环境进行健全。意识到第一个问题，我们旨在设计一个名为Full-Stack FL（F2L）的新型FL框架。更具体地说，F2L使用层次结构架构，使扩展FL网络可以访问而无需重建整个网络系统。此外，利用层次网络设计的优势，我们在全球服务器上提出了一种新的标签驱动知识蒸馏（LKD）技术来解决第二个问题。与当前的知识蒸馏技术相反，LKD能够训练学生模型，该模型由所有教师模型的良好知识组成。因此，我们提出的算法可以有效地提取区域数据分布（即区域汇总模型）的知识，以减少客户在使用非独立分布数据的FL系统下操作时客户模型之间的差异。广泛的实验结果表明：（i）我们的F2L方法可以显着提高所有全球蒸馏的总体FL效率，并且（ii）F2L随着全球蒸馏阶段的发生而迅速达到收敛性，而不是在每个通信周期中提高。

translated by 谷歌翻译

Predicting Performances of Mutual Funds using Deep Learning and Ensemble Techniques

Nghia Chu , Binh Dao , Nga Pham , Huy Nguyen , Hien Tran

分类：机器学习

2022-09-18

预测基金绩效对投资者和基金经理都是有益的，但这是一项艰巨的任务。在本文中，我们测试了深度学习模型是否比传统统计技术更准确地预测基金绩效。基金绩效通常通过Sharpe比率进行评估，该比例代表了风险调整的绩效，以确保基金之间有意义的可比性。我们根据每月收益率数据序列数据计算了年度夏普比率，该数据的时间序列数据为600多个投资于美国上市大型股票的开放式共同基金投资。我们发现，经过现代贝叶斯优化训练的长期短期记忆（LSTM）和封闭式复发单元（GRUS）深度学习方法比传统统计量相比，预测基金的Sharpe比率更高。结合了LSTM和GRU的预测的合奏方法，可以实现所有模型的最佳性能。有证据表明，深度学习和结合能提供有希望的解决方案，以应对基金绩效预测的挑战。

translated by 谷歌翻译

vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTM

Thanh Tin Nguyen , Long H. Nguyen , Nhat Truong Pham , Liu Tai Nguyen , Van Huong Do , Hai Nguyen , Ngoc Duy Nguyen

分类：计算机视觉 | 人工智能 | 自然语言处理

2022-09-03

这项研究介绍了我们对越南语言和语音处理任务（VLSP）挑战2021的文本处理任务的医疗保健领域的自动越南图像字幕的方法作为编码器的体系结构和长期的短期内存（LSTM）作为解码器生成句子。这些模型在不同的数据集中表现出色。我们提出的模型还具有编码器和一个解码器，但是我们在编码器中使用了SWIN变压器，LSTM与解码器中的注意模块结合在一起。该研究介绍了我们在比赛期间使用的培训实验和技术。我们的模型在vietcap4h数据集上达到了0.293的BLEU4分数，并且该分数在私人排行榜上排名3 $^{rd} $。我们的代码可以在\ url {https://git.io/jddjm}上找到。

translated by 谷歌翻译

Benchmark Results for Bookshelf Organization Problem as Mixed Integer Nonlinear Program with Mode Switch and Collision Avoidance

Xuan Lin , Gabriel I. Fernandez , Dennis W. Hong

分类：机器人

2022-08-28

混合整数凸面和非线性程序MICP和MINLP具有表现力，但需要长时间解决时间。结合了数据驱动方法的求解器启发式方法的最新工作表明，有可能克服此问题，从而可以在更大规模的实际问题上进行应用。为了通过数据驱动的方法在线求解混合企业双线性程序，存在几种配方，包括具有互补约束（MPCC），混合智能编程（MIP）的数学编程。在这项工作中，我们将这些数据驱动方案的性能基于具有离散模式开关和避免碰撞限制的书架组织问题的性能。将成功率，最佳成本和解决时间与非DATA驱动方法进行比较。我们提出的方法被证明是用于书架问题的机器人臂的高级计划者。

translated by 谷歌翻译

HTML版本

Understanding Long Documents with Different Position-Aware Attentions

Hai Pham , Guoxin Wang , Yijuan Lu , Dinei Florencio , Cha Zhang

分类：自然语言处理 | 人工智能

2022-08-17

尽管在文档理解方面取得了成功，但由于计算中的几个挑战以及如何有效吸收长期多模式输入，因此长期文档理解的实际任务在很大程度上尚未探索。大多数基于变压器的方法仅处理简短的文档，并且由于其过度的计算和内存限制，因此仅使用文本信息来引起注意。为了解决长期文档理解中的这些问题，我们探索了处理1D和新的2D位置引人入胜的不同方法，并以本质上的背景缩短了。实验结果表明，我们提出的模型基于各种评估指标具有此任务的优势。此外，我们的模型仅对注意力进行更改，因此很容易适应任何基于变压器的体系结构。

translated by 谷歌翻译

Multi-modal Transformer Path Prediction for Autonomous Vehicle

Chia Hong Tseng , Jie Zhang , Min-Te Sun , Kazuya Sakai , Wei-Shinn Ku

分类：机器人 | 计算机视觉

2022-08-15

关于车辆路径预测的推理是自动驾驶系统安全运行的必不可少的问题。有许多用于路径预测的研究工作。但是，其中大多数不使用车道信息，也不基于变压器体系结构。通过利用从配备自动驾驶车辆的传感器收集的不同类型的数据，我们提出了一个名为多模式变压器路径预测（MTPP）的路径预测系统，该系统旨在预测目标试剂的长期未来轨迹。为了实现更准确的路径预测，在我们的模型中采用了变压器体系结构。为了更好地利用车道信息，目标试剂不太可能采用与目标试剂相反的车道，因此被过滤掉。另外，将连续的车道块组合在一起，以确保车道输入足够长以进行路径预测。进行了广泛的评估，以显示使用Nuscene（现实世界中的轨迹预测数据集）的拟议系统的功效。

translated by 谷歌翻译